AITopics

2510.11595

Country:

Europe > United Kingdom (0.28)
North America (0.28)

Genre:

Research Report > New Finding (0.93)
Research Report > Experimental Study (0.93)

Industry:

Government (0.95)
Banking & Finance > Economy (0.93)
Information Technology > Security & Privacy (0.68)
Health & Medicine > Therapeutic Area > Oncology (0.47)

Technology: Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.88)

Moulton, Richard H., McCully, Gary A., Hastings, John D.

Confronting the Reproducibility Crisis: A Case Study in Validating Certified Robustness

arXiv.org Artificial IntelligenceMay-29-2024

Reproducibility is a cornerstone of scientific research, enabling validation, extension, and progress. However, the rapidly evolving nature of software and dependencies poses significant challenges to reproducing research results, particularly in fields like adversarial robustness for deep neural networks, where complex codebases and specialized toolkits are utilized. This paper presents a case study of attempting to validate the results on certified adversarial robustness in "SoK: Certified Robustness for Deep Neural Networks" using the VeriGauge toolkit. Despite following the documented methodology, numerous software and hardware compatibility issues were encountered, including outdated or unavailable dependencies, version conflicts, and driver incompatibilities. While a subset of the original results could be run, key findings related to the empirical robust accuracy of various verification methods proved elusive due to these technical obstacles, as well as slight discrepancies in the test results. This practical experience sheds light on the reproducibility crisis afflicting adversarial robustness research, where a lack of reproducibility threatens scientific integrity and hinders progress. The paper discusses the broader implications of this crisis, proposing potential solutions such as containerization, software preservation, and comprehensive documentation practices. Furthermore, it highlights the need for collaboration and standardization efforts within the research community to develop robust frameworks for reproducible research. By addressing the reproducibility crisis head-on, this work aims to contribute to the ongoing discourse on scientific reproducibility and advocate for best practices that ensure the reliability and validity of research findings within not only adversarial robustness, but security and technology research as a whole.

confronting, reproducibility crisis, validating certified robustness, (1 more...)

2405.18753

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)

Semmelrock, Harald, Kopeinik, Simone, Theiler, Dieter, Ross-Hellauer, Tony, Kowald, Dominik

Reproducibility in Machine Learning-Driven Research

arXiv.org Artificial IntelligenceJul-19-2023

Research is facing a reproducibility crisis, in which the results and findings of many studies are difficult or even impossible to reproduce. This is also the case in machine learning (ML) and artificial intelligence (AI) research. Often, this is the case due to unpublished data and/or source-code, and due to sensitivity to ML training conditions. Although different solutions to address this issue are discussed in the research community such as using ML platforms, the level of reproducibility in ML-driven research is not increasing substantially. Therefore, in this mini survey, we review the literature on reproducibility in ML-driven research with three main aims: (i) reflect on the current situation of ML reproducibility in various research fields, (ii) identify reproducibility issues and barriers that exist in these research fields applying ML, and (iii) identify potential drivers such as tools, practices, and interventions that support ML reproducibility. With this, we hope to contribute to decisions on the viability of different solutions for supporting ML reproducibility.

artificial intelligence, machine learning, natural language, (18 more...)

2307.1032

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Austria > Styria > Graz (0.04)
Europe > Portugal > Lisbon > Lisbon (0.04)

Genre: Research Report (1.00)

Industry:

Health & Medicine (1.00)
Information Technology > Security & Privacy (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

#artificialintelligenceAug-16-2022, 05:54:18 GMT

Last Week in AI #180: Meta's troubled chat bot, AI in femtech, Science AI's reproducibility crises, and more!

Did Meta not learn anything from Microsoft's infamous chatbot Tay? On August 5, Meta released BlenderBot 3, an AI chatbot, to users in the US. As Meta warned, BlenderBot indeed was "likely to make untrue or offensive statements": it described Mark Zuckerberg as "too creepy and manipulative" to a reporter from Insider and claimed Trump was still president and "always will be" to a Wall Street Journal reporter. Users can flag BlenderBot's inappropriate and offensive responses, and Meta claims it has reduced offensive responses by 90 percent. Our Take: Color me amused and not surprised.

artificial intelligence, intelligence, reproducibility crisis, (15 more...)

Country:

North America > United States > Washington > King County > Seattle (0.05)
North America > United States > Illinois > Cook County > Chicago (0.05)
North America > United States > Colorado (0.05)
Asia > China > Beijing > Beijing (0.05)

Genre: Research Report (0.47)

Industry:

Media > News (0.49)
Transportation > Ground > Road (0.47)
Information Technology > Robotics & Automation (0.47)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.71)

#artificialintelligenceAug-10-2022, 21:52:13 GMT

Sloppy Use of Machine Learning Is Causing a 'Reproducibility Crisis' in Science

Machine learning involves feeding an algorithm data from the past that tunes it to operate on future, unseen data.

machine learning, reproducibility crisis, sloppy use

Industry: Media > News (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.88)

WIREDAug-10-2022, 11:00:00 GMT

Sloppy Use of Machine Learning is Causing a 'Reproducibility Crisis' in Science

History shows civil wars to be among the messiest, most horrifying of human affairs. So Princeton professor Arvind Narayanan and his PhD student Sayash Kapoor got suspicious last year when they discovered a strand of political science research claiming to predict when a civil war will break out with more than 90 percent accuracy, thanks to artificial intelligence. A series of papers described astonishing results from using machine learning, the technique beloved by tech giants that underpins modern AI. Applying it to data such as a country's gross domestic product and unemployment rate was said to beat more conventional statistical methods at predicting the outbreak of civil war by almost 20 percentage points. Yet when the Princeton researchers looked more closely, many of the results turned out to be a mirage.

kapoor, machine learning, reproducibility crisis, (8 more...)

WIRED

Industry: Banking & Finance > Economy (0.93)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.78)

#artificialintelligenceJul-29-2022, 10:26:03 GMT

Could machine learning fuel a reproducibility crisis in science?

A CT scan of a tumor in human lungs. Researchers are experimenting with AI algorithms that can spot early signs of the disease.Credit: K. H. Fung/SPL From biomedicine to political sciences, researchers increasingly use machine learning as a tool to make predictions on the basis of patterns in their data. But the claims in many such studies are likely to be overblown, according to a pair of researchers at Princeton University in New Jersey. They want to sound an alarm about what they call a "brewing reproducibility crisis" in machine-learning-based sciences. Machine learning is being sold as a tool that researchers can learn in a few hours and use by themselves -- and many follow that advice, says Sayash Kapoor, a machine-learning researcher at Princeton.

leakage, prediction, reproducibility crisis, (16 more...)

AI-Alerts: 2022 > 2022-08 > AAAI AI-Alert for Aug 2, 2022 (1.00)

Country:

North America > United States > New Jersey (0.25)
North America > United States > Minnesota > Olmsted County > Rochester (0.05)
North America > United States > Illinois > Cook County > Evanston (0.05)
Europe > United Kingdom > England > West Midlands > Birmingham (0.05)

Industry:

Health & Medicine > Diagnostic Medicine > Imaging (0.35)
Health & Medicine > Consumer Health (0.35)
Education > Educational Setting > Online (0.31)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

#artificialintelligenceJul-27-2022, 07:22:05 GMT

Could machine learning fuel a reproducibility crisis in science?

leakage, prediction, reproducibility crisis, (16 more...)

Country:

North America > United States > New Jersey (0.25)
North America > United States > Minnesota > Olmsted County > Rochester (0.05)
North America > United States > Illinois > Cook County > Evanston (0.05)
Europe > United Kingdom > England > West Midlands > Birmingham (0.05)

Industry:

Health & Medicine > Diagnostic Medicine > Imaging (0.35)
Health & Medicine > Consumer Health (0.35)
Education > Educational Setting > Online (0.31)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Kapoor, Sayash, Narayanan, Arvind

Leakage and the Reproducibility Crisis in ML-based Science

arXiv.org Artificial IntelligenceJul-14-2022

The use of machine learning (ML) methods for prediction and forecasting has become widespread across the quantitative sciences. However, there are many known methodological pitfalls, including data leakage, in ML-based science. In this paper, we systematically investigate reproducibility issues in ML-based science. We show that data leakage is indeed a widespread problem and has led to severe reproducibility failures. Specifically, through a survey of literature in research communities that adopted ML methods, we find 17 fields where errors have been found, collectively affecting 329 papers and in some cases leading to wildly overoptimistic conclusions. Based on our survey, we present a fine-grained taxonomy of 8 types of leakage that range from textbook errors to open research problems. We argue for fundamental methodological changes to ML-based science so that cases of leakage can be caught before publication. To that end, we propose model info sheets for reporting scientific claims based on ML models that would address all types of leakage identified in our survey. To investigate the impact of reproducibility errors and the efficacy of model info sheets, we undertake a reproducibility study in a field where complex ML models are believed to vastly outperform older statistical models such as Logistic Regression (LR): civil war prediction. We find that all papers claiming the superior performance of complex ML models compared to LR models fail to reproduce due to data leakage, and complex ML models don't perform substantively better than decades-old LR models. While none of these errors could have been caught by reading the papers, model info sheets would enable the detection of leakage in each case.

artificial intelligence, leakage, machine learning, (15 more...)

2207.07048

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Middle East > Malta > Port Region > Southern Harbour District > Valletta (0.04)
North America > United States > New York > New York County > New York City (0.04)
(6 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Overview (1.00)

Industry:

Health & Medicine > Therapeutic Area (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (1.00)
Government (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)

#artificialintelligenceFeb-13-2022, 09:59:40 GMT

GitHub Is Bad for AI: Solving ML Reproducibility - DZone AI

There is a crisis in machine learning that is preventing the field from progressing as fast as it could. It stems from a broader predicament surrounding reproducibility that impacts scientific research in general. A Nature survey of 1,500 scientists revealed that 70% of researchers have tried and failed to reproduce another scientist's experiments, and over 50% have failed to reproduce their own work. Reproducibility, also called replicability, is a core principle of the scientific method and helps ensure the results of a given study aren't a one-off occurrence but instead represent a replicable observation. In computer science, reproducibility has a more narrow definition: Any results should be documented by making all data and code available so that the computations can be executed again with the same results.

algorithm, artificial intelligence, github, (12 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)